Pronunciation variation modelling using decision tree induction from multiple linguistic parameters

نویسنده

Per-Anders Jande

چکیده

In this paper, resources and methods for annotating speech databases with various types of linguistic information are discussed. The decision tree paradigm is explored for pronunciation variation modelling using multiple linguistic context parameters derived from the annotation. Preliminary results suggest that decision tree induction is a suitable paradigm for the task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modelling Pronunciation in Discourse Context

Abstract This paper describes a method for modelling phone-level pronunciation in discourse context. Spoken language is annotated with linguistic and related information in several layers. The annotation serves as a description of the discourse context and is used as training data for decision tree model induction. In a cross validation experiment, the decision tree pronunciation models are sho...

متن کامل

Spoken language annotation and data-driven modelling of phone-level pronunciation in discourse context

A detailed description of the discourse context of a word can be used for predicting word pronunciation in discourse context and also enables studies of the interplay between various types of information on e.g. phone-level pronunciation. The work presented in this paper is aimed at modelling systematic variation in the phone-level realisation of words inherent to a language variety. A data-dri...

متن کامل

Annotating Speech Data for Pronunciation Variation Modelling

This paper describes methods for annotating recorded speech with information hypothesised to be important for the pronunciation of words in discourse context. Annotation is structured into six hierarchically ordered tiers, each tier corresponding to a segmentally defined linguistic unit. Automatic methods are used to segment and annotate the respective annotation tiers. Decision tree models tra...

متن کامل

Inducing decision tree pronunciation variation models from annotated speech data

A model of pronunciation of words in discourse context has been induced from the annotation of a spoken language corpus. The information included in the annotation is a set of variables hypothesised to be important for the pronunciation of words in discourse context. The annotation is connected to segmentally defined units on tiers corresponding to linguistically relevant units: the discourse, ...

متن کامل

Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition

A method of modelling accent-specific pronunciation variations is presented. Speech from an unseen accent group is phonetically transcribed such that pronunciation variations may be derived. These context-dependent variations are clustered in decision trees which are used as a model of the pronunciation variation associated with this new accent group. The trees are then used to build a new pron...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2004

Pronunciation variation modelling using decision tree induction from multiple linguistic parameters

نویسنده

چکیده

منابع مشابه

Modelling Pronunciation in Discourse Context

Spoken language annotation and data-driven modelling of phone-level pronunciation in discourse context

Annotating Speech Data for Pronunciation Variation Modelling

Inducing decision tree pronunciation variation models from annotated speech data

Using accent-specific pronunciation modelling for improved large vocabulary continuous speech recognition

عنوان ژورنال:

اشتراک گذاری